Gamma-Poisson Distribution Model for Text Categorization
نویسندگان
چکیده
منابع مشابه
Bayesian paradigm for analysing count data in longitudina studies using Poisson-generalized log-gamma model
In analyzing longitudinal data with counted responses, normal distribution is usually used for distribution of the random efffects. However, in some applications random effects may not be normally distributed. Misspecification of this distribution may cause reduction of efficiency of estimators. In this paper, a generalized log-gamma distribution is used for the random effects which includes th...
متن کاملWeighted Kernel Model For Text Categorization
Traditional bag-of-words model and recent wordsequence kernel are two well-known techniques in the field of text categorization. Bag-of-words representation neglects the word order, which could result in less computation accuracy for some types of documents. Word-sequence kernel takes into account word order, but does not include all information of the word frequency. A weighted kernel model th...
متن کاملOn the compound Poisson-gamma distribution
The compound Poisson-gamma variable is the sum of a random sample from a gamma distribution with sample size an independent Poisson random variable. It has received wide ranging applications. In this note, we give an account of its mathematical properties including estimation procedures by the methods of moments and maximum likelihood. Most of the properties given are hitherto unknown.
متن کاملTest Model for Text Categorization and Text Summarization
Abstract—Text Categorization is the task of automatically sorting a set of documents into categories from a predefined set and Text Summarization is a brief and accurate representation of input text such that the output covers the most important concepts of the source in a condensed manner. Document Summarization is an emerging technique for understanding the main purpose of any kind of documen...
متن کاملThe Infinite Gamma-Poisson Feature Model
We present a probability distribution over non-negative integer valued matrices with possibly an infinite number of columns. We also derive a stochastic process that reproduces this distribution over equivalence classes. This model can play the role of the prior in nonparametric Bayesian learning scenarios where multiple latent features are associated with the observed data and each feature can...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ISRN Artificial Intelligence
سال: 2013
ISSN: 2090-7443
DOI: 10.1155/2013/829630